-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Doc] Add missing classes to doc #1203
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Feb 3, 2025
ghstack-source-id: 2e174577aa33cc8d69c0f423c90ea2e5ee0fdef6 Pull Request resolved: #1203
vmoens
added a commit
that referenced
this pull request
Feb 3, 2025
ghstack-source-id: 2e174577aa33cc8d69c0f423c90ea2e5ee0fdef6 Pull Request resolved: #1203
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 67.4360μs | 21.2185μs | 47.1286 KOps/s | 48.6519 KOps/s | |
test_plain_set_stack_nested | 56.0350μs | 21.3564μs | 46.8243 KOps/s | 47.7523 KOps/s | |
test_plain_set_nested_inplace | 0.1064ms | 23.3333μs | 42.8572 KOps/s | 44.5441 KOps/s | |
test_plain_set_stack_nested_inplace | 59.3410μs | 23.2162μs | 43.0733 KOps/s | 44.5772 KOps/s | |
test_items | 41.5380μs | 4.1647μs | 240.1158 KOps/s | 244.0307 KOps/s | |
test_items_nested | 0.7465ms | 0.4106ms | 2.4353 KOps/s | 2.4483 KOps/s | |
test_items_nested_locked | 0.8650ms | 0.4113ms | 2.4313 KOps/s | 2.4520 KOps/s | |
test_items_nested_leaf | 0.1571ms | 76.8573μs | 13.0111 KOps/s | 12.9647 KOps/s | |
test_items_stack_nested | 0.6140ms | 0.4109ms | 2.4339 KOps/s | 2.3473 KOps/s | |
test_items_stack_nested_leaf | 0.1570ms | 78.2684μs | 12.7765 KOps/s | 12.6667 KOps/s | |
test_items_stack_nested_locked | 0.5762ms | 0.4140ms | 2.4152 KOps/s | 2.4260 KOps/s | |
test_keys | 31.4290μs | 3.7320μs | 267.9516 KOps/s | 281.1827 KOps/s | |
test_keys_nested | 0.2853ms | 0.1647ms | 6.0731 KOps/s | 6.0308 KOps/s | |
test_keys_nested_locked | 1.7808ms | 0.1716ms | 5.8265 KOps/s | 5.8217 KOps/s | |
test_keys_nested_leaf | 0.2609ms | 0.1440ms | 6.9425 KOps/s | 6.9411 KOps/s | |
test_keys_stack_nested | 0.3060ms | 0.1654ms | 6.0443 KOps/s | 6.0791 KOps/s | |
test_keys_stack_nested_leaf | 0.2258ms | 0.1427ms | 7.0065 KOps/s | 6.9797 KOps/s | |
test_keys_stack_nested_locked | 0.2330ms | 0.1720ms | 5.8144 KOps/s | 5.8349 KOps/s | |
test_values | 14.0862μs | 1.0307μs | 970.1922 KOps/s | 887.8213 KOps/s | |
test_values_nested | 0.1200ms | 62.5467μs | 15.9881 KOps/s | 16.2377 KOps/s | |
test_values_nested_locked | 0.1185ms | 62.5316μs | 15.9919 KOps/s | 16.3723 KOps/s | |
test_values_nested_leaf | 0.1303ms | 71.7894μs | 13.9296 KOps/s | 14.1682 KOps/s | |
test_values_stack_nested | 0.1102ms | 63.2012μs | 15.8225 KOps/s | 15.3680 KOps/s | |
test_values_stack_nested_leaf | 0.1335ms | 71.4954μs | 13.9869 KOps/s | 14.2818 KOps/s | |
test_values_stack_nested_locked | 0.1498ms | 63.8239μs | 15.6681 KOps/s | 16.1445 KOps/s | |
test_membership | 28.4940μs | 0.8850μs | 1.1300 MOps/s | 1.1530 MOps/s | |
test_membership_nested | 38.0410μs | 2.8823μs | 346.9402 KOps/s | 347.0271 KOps/s | |
test_membership_nested_leaf | 51.3660μs | 2.8687μs | 348.5874 KOps/s | 349.5149 KOps/s | |
test_membership_stacked_nested | 34.2840μs | 2.8837μs | 346.7762 KOps/s | 350.6693 KOps/s | |
test_membership_stacked_nested_leaf | 57.6780μs | 2.8641μs | 349.1478 KOps/s | 344.5475 KOps/s | |
test_membership_nested_last | 27.4410μs | 4.3374μs | 230.5508 KOps/s | 231.7429 KOps/s | |
test_membership_nested_leaf_last | 50.3940μs | 4.3097μs | 232.0342 KOps/s | 230.8129 KOps/s | |
test_membership_stacked_nested_last | 36.2480μs | 4.3118μs | 231.9192 KOps/s | 224.7012 KOps/s | |
test_membership_stacked_nested_leaf_last | 24.0650μs | 4.3192μs | 231.5255 KOps/s | 231.7342 KOps/s | |
test_nested_getleaf | 57.8680μs | 10.7513μs | 93.0124 KOps/s | 95.5994 KOps/s | |
test_nested_get | 59.4610μs | 10.2048μs | 97.9936 KOps/s | 99.6889 KOps/s | |
test_stacked_getleaf | 31.7090μs | 10.6609μs | 93.8008 KOps/s | 95.6049 KOps/s | |
test_stacked_get | 62.3770μs | 10.1373μs | 98.6460 KOps/s | 99.5982 KOps/s | |
test_nested_getitemleaf | 75.5510μs | 11.2736μs | 88.7029 KOps/s | 90.4234 KOps/s | |
test_nested_getitem | 41.0470μs | 10.6974μs | 93.4805 KOps/s | 94.5818 KOps/s | |
test_stacked_getitemleaf | 68.9590μs | 11.3653μs | 87.9868 KOps/s | 89.3894 KOps/s | |
test_stacked_getitem | 46.8880μs | 10.7805μs | 92.7598 KOps/s | 95.0249 KOps/s | |
test_lock_nested | 0.5836ms | 0.4186ms | 2.3887 KOps/s | 2.4267 KOps/s | |
test_lock_stack_nested | 0.7270ms | 0.4282ms | 2.3352 KOps/s | 2.3625 KOps/s | |
test_unlock_nested | 0.7106ms | 0.3417ms | 2.9267 KOps/s | 2.9539 KOps/s | |
test_unlock_stack_nested | 0.5069ms | 0.3434ms | 2.9118 KOps/s | 2.9169 KOps/s | |
test_flatten_speed | 0.1818ms | 0.1009ms | 9.9069 KOps/s | 10.1416 KOps/s | |
test_unflatten_speed | 0.6470ms | 0.5311ms | 1.8830 KOps/s | 1.9340 KOps/s | |
test_common_ops | 4.4996ms | 0.8548ms | 1.1698 KOps/s | 1.2367 KOps/s | |
test_creation | 66.3440μs | 2.4806μs | 403.1217 KOps/s | 408.4446 KOps/s | |
test_creation_empty | 46.8880μs | 12.9864μs | 77.0038 KOps/s | 86.1836 KOps/s | |
test_creation_nested_1 | 44.9750μs | 16.1413μs | 61.9530 KOps/s | 68.3311 KOps/s | |
test_creation_nested_2 | 51.5160μs | 20.5414μs | 48.6821 KOps/s | 52.8490 KOps/s | |
test_clone | 73.5370μs | 13.4783μs | 74.1936 KOps/s | 75.0471 KOps/s | |
test_getitem[int] | 0.8094ms | 12.8208μs | 77.9985 KOps/s | 76.4474 KOps/s | |
test_getitem[slice_int] | 0.1429ms | 23.9744μs | 41.7112 KOps/s | 41.4521 KOps/s | |
test_getitem[range] | 0.2071ms | 50.5488μs | 19.7829 KOps/s | 20.2363 KOps/s | |
test_getitem[tuple] | 0.1276ms | 19.9784μs | 50.0541 KOps/s | 50.3144 KOps/s | |
test_getitem[list] | 0.2176ms | 46.5444μs | 21.4849 KOps/s | 21.9073 KOps/s | |
test_setitem_dim[int] | 60.8540μs | 26.2365μs | 38.1149 KOps/s | 38.4120 KOps/s | |
test_setitem_dim[slice_int] | 97.0010μs | 51.7186μs | 19.3354 KOps/s | 19.0132 KOps/s | |
test_setitem_dim[range] | 0.1236ms | 76.5111μs | 13.0700 KOps/s | 13.0251 KOps/s | |
test_setitem_dim[tuple] | 77.3440μs | 40.5831μs | 24.6408 KOps/s | 24.7806 KOps/s | |
test_setitem | 0.1246ms | 21.3761μs | 46.7813 KOps/s | 48.6576 KOps/s | |
test_set | 99.8070μs | 20.8180μs | 48.0354 KOps/s | 49.7222 KOps/s | |
test_set_shared | 0.4556ms | 0.1828ms | 5.4707 KOps/s | 5.5259 KOps/s | |
test_update | 0.2182ms | 24.4956μs | 40.8237 KOps/s | 44.0656 KOps/s | |
test_update_nested | 0.1289ms | 34.8272μs | 28.7132 KOps/s | 30.5965 KOps/s | |
test_update__nested | 0.4521ms | 34.0683μs | 29.3528 KOps/s | 29.7687 KOps/s | |
test_set_nested | 0.1041ms | 23.3700μs | 42.7899 KOps/s | 45.3060 KOps/s | |
test_set_nested_new | 0.1189ms | 28.1582μs | 35.5137 KOps/s | 37.5749 KOps/s | |
test_select | 0.1272ms | 44.8550μs | 22.2941 KOps/s | 23.1171 KOps/s | |
test_select_nested | 0.1487ms | 63.1034μs | 15.8470 KOps/s | 15.8927 KOps/s | |
test_exclude_nested | 0.1525ms | 80.8127μs | 12.3743 KOps/s | 12.2587 KOps/s | |
test_empty[True] | 0.5793ms | 0.4119ms | 2.4275 KOps/s | 2.4626 KOps/s | |
test_empty[False] | 13.0343μs | 1.3879μs | 720.5083 KOps/s | 728.1549 KOps/s | |
test_unbind_speed | 0.4827ms | 0.2707ms | 3.6947 KOps/s | 3.6706 KOps/s | |
test_unbind_speed_stack0 | 0.4108ms | 0.2713ms | 3.6858 KOps/s | 3.7078 KOps/s | |
test_unbind_speed_stack1 | 0.1082s | 0.7533ms | 1.3275 KOps/s | 1.2312 KOps/s | |
test_split | 0.1115s | 1.7716ms | 564.4536 Ops/s | 570.5434 Ops/s | |
test_chunk | 0.1137s | 1.7731ms | 563.9827 Ops/s | 630.0014 Ops/s | |
test_consolidate_njt[False-None] | 8.6753ms | 8.2821ms | 120.7427 Ops/s | 111.1239 Ops/s | |
test_creation[device0] | 0.2715ms | 93.9764μs | 10.6410 KOps/s | 10.9895 KOps/s | |
test_creation_from_tensor | 3.8526ms | 98.0890μs | 10.1948 KOps/s | 10.3412 KOps/s | |
test_add_one[memmap_tensor0] | 0.2126ms | 4.7223μs | 211.7622 KOps/s | 201.3600 KOps/s | |
test_contiguous[memmap_tensor0] | 19.4370μs | 0.5174μs | 1.9327 MOps/s | 1.9020 MOps/s | |
test_stack[memmap_tensor0] | 35.1360μs | 3.3440μs | 299.0474 KOps/s | 288.5775 KOps/s | |
test_memmaptd_index | 0.3347ms | 0.2269ms | 4.4073 KOps/s | 4.2660 KOps/s | |
test_memmaptd_index_astensor | 1.0505ms | 0.3170ms | 3.1547 KOps/s | 3.1378 KOps/s | |
test_memmaptd_index_op | 0.8305ms | 0.6019ms | 1.6613 KOps/s | 1.7278 KOps/s | |
test_serialize_model | 0.2307s | 0.1359s | 7.3607 Ops/s | 8.7710 Ops/s | |
test_serialize_model_pickle | 0.4669s | 0.3965s | 2.5218 Ops/s | 2.5798 Ops/s | |
test_serialize_weights | 0.1201s | 0.1160s | 8.6226 Ops/s | 9.0035 Ops/s | |
test_serialize_weights_returnearly | 0.2063s | 0.1692s | 5.9088 Ops/s | 6.5400 Ops/s | |
test_serialize_weights_pickle | 1.0048s | 0.7426s | 1.3467 Ops/s | 2.5762 Ops/s | |
test_serialize_weights_filesystem | 0.1515s | 0.1407s | 7.1095 Ops/s | 6.8488 Ops/s | |
test_serialize_model_filesystem | 0.2544s | 0.1628s | 6.1419 Ops/s | 6.4755 Ops/s | |
test_reshape_pytree | 73.4780μs | 26.5161μs | 37.7129 KOps/s | 37.6229 KOps/s | |
test_reshape_td | 0.1089ms | 33.2962μs | 30.0334 KOps/s | 30.9705 KOps/s | |
test_view_pytree | 93.8120μs | 26.4657μs | 37.7848 KOps/s | 38.0596 KOps/s | |
test_view_td | 89.6780μs | 38.8996μs | 25.7072 KOps/s | 26.2201 KOps/s | |
test_unbind_pytree | 98.9190μs | 30.1253μs | 33.1947 KOps/s | 34.2877 KOps/s | |
test_unbind_td | 0.3598ms | 40.1640μs | 24.8979 KOps/s | 25.2412 KOps/s | |
test_split_pytree | 72.0050μs | 29.9569μs | 33.3813 KOps/s | 34.5501 KOps/s | |
test_split_td | 0.5125ms | 44.4959μs | 22.4740 KOps/s | 21.9944 KOps/s | |
test_add_pytree | 74.3090μs | 35.6857μs | 28.0225 KOps/s | 27.7988 KOps/s | |
test_add_td | 0.1650ms | 59.8030μs | 16.7216 KOps/s | 18.1608 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1712ms | 67.7058μs | 14.7698 KOps/s | 15.3178 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4167ms | 0.1761ms | 5.6781 KOps/s | 5.8407 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1039ms | 45.8763μs | 21.7977 KOps/s | 22.0388 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2688ms | 0.1187ms | 8.4219 KOps/s | 8.5317 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 67.5370μs | 29.1975μs | 34.2495 KOps/s | 35.0789 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1551ms | 59.0562μs | 16.9330 KOps/s | 17.0912 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1769ms | 79.9373μs | 12.5098 KOps/s | 12.4876 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1562ms | 67.4909μs | 14.8168 KOps/s | 14.9376 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2376ms | 0.1070ms | 9.3419 KOps/s | 9.5140 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3695ms | 0.2223ms | 4.4987 KOps/s | 4.6393 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1454ms | 47.6670μs | 20.9789 KOps/s | 21.5194 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1921ms | 67.7807μs | 14.7535 KOps/s | 14.8952 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2469ms | 0.1004ms | 9.9616 KOps/s | 10.1019 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.2960ms | 0.2021ms | 4.9482 KOps/s | 5.0183 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3835ms | 0.2383ms | 4.1968 KOps/s | 4.2517 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1900ms | 0.1068ms | 9.3601 KOps/s | 9.3363 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1463ms | 62.5431μs | 15.9890 KOps/s | 15.7196 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1266ms | 47.9809μs | 20.8416 KOps/s | 20.6355 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2843ms | 0.1582ms | 6.3224 KOps/s | 6.4072 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1858ms | 0.1000ms | 9.9958 KOps/s | 9.8514 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 84.1670μs | 21.9930μs | 45.4690 KOps/s | 45.4162 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1265ms | 67.8311μs | 14.7425 KOps/s | 14.9911 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1628ms | 82.2031μs | 12.1650 KOps/s | 12.2820 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1338ms | 68.6807μs | 14.5601 KOps/s | 14.6224 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4388ms | 0.2174ms | 4.6003 KOps/s | 4.6915 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.1100ms | 1.4029ms | 712.8083 Ops/s | 708.3098 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4059ms | 0.2088ms | 4.7896 KOps/s | 4.7650 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.0017ms | 0.8147ms | 1.2275 KOps/s | 1.2042 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.9339ms | 0.4536ms | 2.2046 KOps/s | 2.2229 KOps/s | |
test_compile_assign_and_add_stack[eager] | 5.8105ms | 2.8296ms | 353.4125 Ops/s | 366.3345 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1013ms | 38.9851μs | 25.6508 KOps/s | 25.7857 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5791ms | 32.2219μs | 31.0348 KOps/s | 30.0594 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1014ms | 31.9556μs | 31.2935 KOps/s | 31.8478 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 92.6630μs | 24.6557μs | 40.5586 KOps/s | 42.5611 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1062ms | 32.3315μs | 30.9296 KOps/s | 31.2424 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 72.0040μs | 23.0728μs | 43.3411 KOps/s | 43.6160 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1323ms | 53.2760μs | 18.7702 KOps/s | 18.8844 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.8051ms | 19.2828μs | 51.8598 KOps/s | 48.1745 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1233ms | 46.3516μs | 21.5742 KOps/s | 21.6298 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 75.0300μs | 18.7279μs | 53.3962 KOps/s | 54.1310 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1273ms | 46.8540μs | 21.3429 KOps/s | 21.1771 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 77.5550μs | 18.4050μs | 54.3332 KOps/s | 53.8289 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1475ms | 54.7452μs | 18.2664 KOps/s | 18.3569 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9915ms | 19.5674μs | 51.1055 KOps/s | 48.9521 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1101ms | 47.1285μs | 21.2186 KOps/s | 21.3417 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 91.0210μs | 18.6655μs | 53.5749 KOps/s | 54.4916 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1171ms | 47.7954μs | 20.9225 KOps/s | 21.2940 KOps/s | |
test_compile_indexing[int-pytree-eager] | 80.2700μs | 18.4371μs | 54.2384 KOps/s | 53.9799 KOps/s | |
test_mod_add[eager] | 94.6770μs | 35.4856μs | 28.1804 KOps/s | 27.0570 KOps/s | |
test_mod_add[compile] | 0.1544ms | 64.3241μs | 15.5463 KOps/s | 15.4251 KOps/s | |
test_mod_add[compile-overhead] | 0.1446ms | 64.3472μs | 15.5407 KOps/s | 15.6112 KOps/s | |
test_mod_wrap[eager] | 0.3765ms | 0.2265ms | 4.4144 KOps/s | 4.3111 KOps/s | |
test_mod_wrap[compile] | 1.9487ms | 0.2303ms | 4.3427 KOps/s | 4.1947 KOps/s | |
test_mod_wrap[compile-overhead] | 0.5075ms | 0.2278ms | 4.3893 KOps/s | 4.3120 KOps/s | |
test_mod_wrap_and_backward[eager] | 36.6641ms | 14.6770ms | 68.1336 Ops/s | 88.3693 Ops/s | |
test_mod_wrap_and_backward[compile] | 17.0391ms | 11.8608ms | 84.3113 Ops/s | 89.8701 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.0311ms | 11.2500ms | 88.8888 Ops/s | 87.8183 Ops/s | |
test_seq_add[eager] | 0.2514ms | 0.1176ms | 8.5048 KOps/s | 8.2567 KOps/s | |
test_seq_add[compile] | 0.1577ms | 78.4595μs | 12.7454 KOps/s | 13.1381 KOps/s | |
test_seq_add[compile-overhead] | 0.1901ms | 76.4119μs | 13.0870 KOps/s | 13.2377 KOps/s | |
test_seq_wrap[eager] | 0.7222ms | 0.4533ms | 2.2059 KOps/s | 2.1905 KOps/s | |
test_seq_wrap[compile] | 0.3813ms | 0.2460ms | 4.0658 KOps/s | 3.9882 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4611ms | 0.2459ms | 4.0661 KOps/s | 4.0168 KOps/s | |
test_func_call_runtime[False-eager] | 0.7387ms | 0.5544ms | 1.8037 KOps/s | 1.7451 KOps/s | |
test_func_call_runtime[False-compile] | 0.5605ms | 0.4456ms | 2.2440 KOps/s | 2.1714 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5903ms | 0.4478ms | 2.2331 KOps/s | 2.1576 KOps/s | |
test_func_call_runtime[True-eager] | 1.0415ms | 0.7717ms | 1.2958 KOps/s | 1.2784 KOps/s | |
test_func_call_runtime[True-compile] | 0.7839ms | 0.4685ms | 2.1345 KOps/s | 2.0588 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8718ms | 0.4718ms | 2.1196 KOps/s | 2.0830 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.3101ms | 0.5660ms | 1.7668 KOps/s | 1.7867 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.7357ms | 0.4493ms | 2.2255 KOps/s | 2.1591 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6205ms | 0.4482ms | 2.2312 KOps/s | 2.1941 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4763ms | 0.9141ms | 1.0940 KOps/s | 1.0815 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.2509ms | 0.8125ms | 1.2308 KOps/s | 1.1994 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2265ms | 0.8159ms | 1.2257 KOps/s | 1.2070 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.7430ms | 1.9421ms | 514.8972 Ops/s | 501.3248 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0938ms | 0.5420ms | 1.8449 KOps/s | 1.7816 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8886ms | 0.5419ms | 1.8453 KOps/s | 1.8134 KOps/s | |
test_distributed | 0.4531ms | 0.1255ms | 7.9674 KOps/s | 7.6611 KOps/s | |
test_tdmodule | 0.1188ms | 27.1940μs | 36.7728 KOps/s | 36.8466 KOps/s | |
test_tdmodule_dispatch | 93.4250μs | 49.6220μs | 20.1523 KOps/s | 20.4397 KOps/s | |
test_tdseq | 69.3100μs | 30.1790μs | 33.1357 KOps/s | 33.7868 KOps/s | |
test_tdseq_dispatch | 94.0760μs | 56.0739μs | 17.8336 KOps/s | 18.0889 KOps/s | |
test_instantiation_functorch | 1.7698ms | 1.5333ms | 652.1672 Ops/s | 639.5068 Ops/s | |
test_exec_functorch | 0.2912ms | 0.1805ms | 5.5414 KOps/s | 5.5486 KOps/s | |
test_exec_functional_call | 0.3298ms | 0.1726ms | 5.7924 KOps/s | 5.8387 KOps/s | |
test_exec_td_decorator | 0.5455ms | 0.2324ms | 4.3020 KOps/s | 4.2458 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8726ms | 0.6696ms | 1.4933 KOps/s | 1.4814 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.3254ms | 0.6836ms | 1.4629 KOps/s | 1.4588 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7364ms | 0.5412ms | 1.8477 KOps/s | 1.8327 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8099ms | 0.5429ms | 1.8420 KOps/s | 1.8172 KOps/s | |
test_to_module_speed[True] | 2.0167ms | 1.3591ms | 735.8046 Ops/s | 743.6681 Ops/s | |
test_to_module_speed[False] | 1.8273ms | 1.3131ms | 761.5482 Ops/s | 752.2260 Ops/s | |
test_tc_init | 95.7400μs | 49.2575μs | 20.3015 KOps/s | 20.9211 KOps/s | |
test_tc_init_nested | 0.2005ms | 97.6750μs | 10.2380 KOps/s | 10.6874 KOps/s | |
test_tc_first_layer_tensor | 23.4140μs | 1.6139μs | 619.6065 KOps/s | 636.4004 KOps/s | |
test_tc_first_layer_nontensor | 45.6250μs | 4.7803μs | 209.1898 KOps/s | 212.6301 KOps/s | |
test_tc_second_layer_tensor | 27.2610μs | 2.9804μs | 335.5275 KOps/s | 342.7123 KOps/s | |
test_tc_second_layer_nontensor | 30.7580μs | 6.1444μs | 162.7507 KOps/s | 164.3015 KOps/s | |
test_unbind | 0.2512s | 14.2587ms | 70.1327 Ops/s | 74.3232 Ops/s | |
test_full_like | 10.0138ms | 8.6963ms | 114.9914 Ops/s | 109.8619 Ops/s | |
test_zeros_like | 8.6451ms | 4.8116ms | 207.8311 Ops/s | 315.1094 Ops/s | |
test_ones_like | 5.2104ms | 3.9040ms | 256.1467 Ops/s | 269.2782 Ops/s | |
test_clone | 6.7007ms | 5.7041ms | 175.3128 Ops/s | 178.5024 Ops/s | |
test_squeeze | 0.1083ms | 12.3872μs | 80.7282 KOps/s | 78.4823 KOps/s | |
test_unsqueeze | 0.1736ms | 92.6865μs | 10.7891 KOps/s | 10.4051 KOps/s | |
test_split | 0.3722ms | 0.1969ms | 5.0794 KOps/s | 5.0275 KOps/s | |
test_permute | 0.3572ms | 0.2054ms | 4.8687 KOps/s | 4.7458 KOps/s | |
test_stack | 30.2638ms | 26.7301ms | 37.4110 Ops/s | 37.6349 Ops/s | |
test_cat | 30.2103ms | 26.2914ms | 38.0352 Ops/s | 38.1020 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 26.5610μs | 11.6595μs | 85.7667 KOps/s | 76.1461 KOps/s | |
test_plain_set_stack_nested | 0.1924ms | 11.8112μs | 84.6655 KOps/s | 74.9771 KOps/s | |
test_plain_set_nested_inplace | 36.5210μs | 12.6585μs | 78.9981 KOps/s | 70.5476 KOps/s | |
test_plain_set_stack_nested_inplace | 0.2023ms | 12.7162μs | 78.6398 KOps/s | 70.7106 KOps/s | |
test_items | 0.1660ms | 2.8645μs | 349.0988 KOps/s | 348.7121 KOps/s | |
test_items_nested | 0.4200ms | 0.3677ms | 2.7197 KOps/s | 2.7512 KOps/s | |
test_items_nested_locked | 0.4193ms | 0.3661ms | 2.7317 KOps/s | 2.7450 KOps/s | |
test_items_nested_leaf | 0.1621ms | 59.1683μs | 16.9009 KOps/s | 17.2142 KOps/s | |
test_items_stack_nested | 0.4013ms | 0.3674ms | 2.7218 KOps/s | 2.7717 KOps/s | |
test_items_stack_nested_leaf | 90.2210μs | 59.1692μs | 16.9007 KOps/s | 16.9869 KOps/s | |
test_items_stack_nested_locked | 0.5221ms | 0.3669ms | 2.7255 KOps/s | 2.7561 KOps/s | |
test_keys | 29.6000μs | 3.4617μs | 288.8770 KOps/s | 289.1305 KOps/s | |
test_keys_nested | 0.2348ms | 87.4397μs | 11.4365 KOps/s | 11.4401 KOps/s | |
test_keys_nested_locked | 0.7046ms | 93.9984μs | 10.6385 KOps/s | 10.8544 KOps/s | |
test_keys_nested_leaf | 0.1111ms | 78.2400μs | 12.7812 KOps/s | 12.7972 KOps/s | |
test_keys_stack_nested | 0.1267ms | 87.8946μs | 11.3773 KOps/s | 11.4029 KOps/s | |
test_keys_stack_nested_leaf | 0.1202ms | 80.1749μs | 12.4727 KOps/s | 12.6875 KOps/s | |
test_keys_stack_nested_locked | 0.1206ms | 93.4760μs | 10.6979 KOps/s | 10.6973 KOps/s | |
test_values | 6.1735μs | 0.8519μs | 1.1738 MOps/s | 1.1773 MOps/s | |
test_values_nested | 88.5310μs | 37.4051μs | 26.7343 KOps/s | 26.8456 KOps/s | |
test_values_nested_locked | 63.5910μs | 38.7891μs | 25.7804 KOps/s | 25.6073 KOps/s | |
test_values_nested_leaf | 82.9110μs | 42.5438μs | 23.5052 KOps/s | 24.1999 KOps/s | |
test_values_stack_nested | 0.1393ms | 37.3135μs | 26.7999 KOps/s | 26.3544 KOps/s | |
test_values_stack_nested_leaf | 0.2029ms | 42.7162μs | 23.4103 KOps/s | 23.8894 KOps/s | |
test_values_stack_nested_locked | 84.3520μs | 39.3817μs | 25.3925 KOps/s | 25.4445 KOps/s | |
test_membership | 10.2942μs | 0.5086μs | 1.9660 MOps/s | 1.9489 MOps/s | |
test_membership_nested | 47.5110μs | 2.0982μs | 476.6001 KOps/s | 508.1170 KOps/s | |
test_membership_nested_leaf | 19.1155μs | 2.0330μs | 491.8911 KOps/s | 498.5894 KOps/s | |
test_membership_stacked_nested | 0.1169ms | 2.1327μs | 468.8916 KOps/s | 485.8798 KOps/s | |
test_membership_stacked_nested_leaf | 34.8710μs | 2.0896μs | 478.5626 KOps/s | 480.9205 KOps/s | |
test_membership_nested_last | 35.3410μs | 3.0842μs | 324.2355 KOps/s | 330.3923 KOps/s | |
test_membership_nested_leaf_last | 42.9110μs | 3.0903μs | 323.5927 KOps/s | 331.9305 KOps/s | |
test_membership_stacked_nested_last | 44.9310μs | 8.1979μs | 121.9820 KOps/s | 122.4256 KOps/s | |
test_membership_stacked_nested_leaf_last | 50.1900μs | 8.1506μs | 122.6909 KOps/s | 122.9178 KOps/s | |
test_nested_getleaf | 43.8910μs | 6.2662μs | 159.5863 KOps/s | 161.8736 KOps/s | |
test_nested_get | 33.8910μs | 5.8440μs | 171.1153 KOps/s | 170.3131 KOps/s | |
test_stacked_getleaf | 31.6100μs | 6.1484μs | 162.6434 KOps/s | 162.2188 KOps/s | |
test_stacked_get | 32.0500μs | 5.8219μs | 171.7642 KOps/s | 169.8979 KOps/s | |
test_nested_getitemleaf | 41.9800μs | 6.5278μs | 153.1906 KOps/s | 152.8297 KOps/s | |
test_nested_getitem | 32.4400μs | 6.1698μs | 162.0794 KOps/s | 162.2102 KOps/s | |
test_stacked_getitemleaf | 47.5110μs | 6.4342μs | 155.4203 KOps/s | 155.4189 KOps/s | |
test_stacked_getitem | 25.7210μs | 6.0812μs | 164.4404 KOps/s | 163.5958 KOps/s | |
test_lock_nested | 10.6896ms | 0.3475ms | 2.8776 KOps/s | 2.8774 KOps/s | |
test_lock_stack_nested | 0.4164ms | 0.3337ms | 2.9963 KOps/s | 2.9593 KOps/s | |
test_unlock_nested | 0.3902ms | 0.2808ms | 3.5617 KOps/s | 3.5789 KOps/s | |
test_unlock_stack_nested | 0.4212ms | 0.2732ms | 3.6597 KOps/s | 3.6323 KOps/s | |
test_flatten_speed | 0.2716ms | 75.2246μs | 13.2935 KOps/s | 13.3286 KOps/s | |
test_unflatten_speed | 0.3712ms | 0.3243ms | 3.0832 KOps/s | 3.1023 KOps/s | |
test_common_ops | 0.8129ms | 0.5800ms | 1.7242 KOps/s | 1.5795 KOps/s | |
test_creation | 0.1324ms | 1.7441μs | 573.3749 KOps/s | 561.6233 KOps/s | |
test_creation_empty | 29.8200μs | 7.0072μs | 142.7098 KOps/s | 97.7240 KOps/s | |
test_creation_nested_1 | 32.6510μs | 8.6650μs | 115.4065 KOps/s | 84.1102 KOps/s | |
test_creation_nested_2 | 44.6010μs | 11.4788μs | 87.1173 KOps/s | 67.7708 KOps/s | |
test_clone | 48.1110μs | 9.6487μs | 103.6409 KOps/s | 102.8347 KOps/s | |
test_getitem[int] | 1.6457ms | 10.8567μs | 92.1089 KOps/s | 93.1885 KOps/s | |
test_getitem[slice_int] | 0.1036ms | 20.8226μs | 48.0247 KOps/s | 48.2381 KOps/s | |
test_getitem[range] | 0.1456ms | 37.8228μs | 26.4391 KOps/s | 28.2028 KOps/s | |
test_getitem[tuple] | 0.1174ms | 18.2122μs | 54.9083 KOps/s | 55.0781 KOps/s | |
test_getitem[list] | 0.1619ms | 32.3594μs | 30.9030 KOps/s | 32.0153 KOps/s | |
test_setitem_dim[int] | 49.2210μs | 18.7800μs | 53.2480 KOps/s | 55.8792 KOps/s | |
test_setitem_dim[slice_int] | 0.1591ms | 39.0516μs | 25.6071 KOps/s | 27.7329 KOps/s | |
test_setitem_dim[range] | 0.1520ms | 54.2238μs | 18.4421 KOps/s | 19.6354 KOps/s | |
test_setitem_dim[tuple] | 51.3210μs | 31.0332μs | 32.2236 KOps/s | 32.4277 KOps/s | |
test_setitem | 0.1027ms | 13.3201μs | 75.0744 KOps/s | 66.2126 KOps/s | |
test_set | 48.4110μs | 12.9229μs | 77.3820 KOps/s | 67.8104 KOps/s | |
test_set_shared | 0.5184ms | 0.1631ms | 6.1317 KOps/s | 6.2300 KOps/s | |
test_update | 0.4082ms | 15.6748μs | 63.7967 KOps/s | 53.3912 KOps/s | |
test_update_nested | 82.9110μs | 20.9069μs | 47.8310 KOps/s | 41.8968 KOps/s | |
test_update__nested | 0.5977ms | 24.8209μs | 40.2886 KOps/s | 42.1511 KOps/s | |
test_set_nested | 0.1393ms | 14.2748μs | 70.0533 KOps/s | 64.0563 KOps/s | |
test_set_nested_new | 59.6810μs | 16.4607μs | 60.7509 KOps/s | 55.8377 KOps/s | |
test_select | 0.2068ms | 28.7931μs | 34.7306 KOps/s | 33.4720 KOps/s | |
test_select_nested | 0.1354ms | 43.6036μs | 22.9339 KOps/s | 22.8750 KOps/s | |
test_exclude_nested | 0.1010ms | 61.9920μs | 16.1311 KOps/s | 15.7729 KOps/s | |
test_empty[True] | 0.3946ms | 0.2938ms | 3.4042 KOps/s | 3.3700 KOps/s | |
test_empty[False] | 3.5950μs | 0.8281μs | 1.2077 MOps/s | 1.1889 MOps/s | |
test_to | 85.3510μs | 54.3566μs | 18.3970 KOps/s | 17.3075 KOps/s | |
test_to_nonblocking | 0.1965ms | 46.4467μs | 21.5301 KOps/s | 21.9680 KOps/s | |
test_unbind_speed | 0.2699ms | 0.2431ms | 4.1129 KOps/s | 4.1974 KOps/s | |
test_unbind_speed_stack0 | 0.3789ms | 0.2380ms | 4.2008 KOps/s | 4.2683 KOps/s | |
test_unbind_speed_stack1 | 0.1040s | 0.7278ms | 1.3740 KOps/s | 1.3685 KOps/s | |
test_split | 1.6023ms | 1.4738ms | 678.5128 Ops/s | 628.8453 Ops/s | |
test_chunk | 0.1038s | 1.8041ms | 554.2990 Ops/s | 623.6865 Ops/s | |
test_consolidate[False-None] | 3.1765ms | 2.6816ms | 372.9133 Ops/s | 371.2272 Ops/s | |
test_consolidate[default-None] | 1.8554ms | 1.7264ms | 579.2408 Ops/s | 586.7769 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.9076ms | 1.7495ms | 571.5841 Ops/s | 572.9154 Ops/s | |
test_consolidate_njt[False-None] | 7.0733ms | 6.5374ms | 152.9650 Ops/s | 106.8957 Ops/s | |
test_to[False-False-None] | 0.3283s | 2.2053ms | 453.4439 Ops/s | 602.7278 Ops/s | |
test_to[True-False-None] | 1.5004ms | 1.3128ms | 761.7025 Ops/s | 748.2362 Ops/s | |
test_to[within-False-None] | 4.2741ms | 4.0820ms | 244.9761 Ops/s | 237.9109 Ops/s | |
test_to[True-default-None] | 5.4376ms | 5.1816ms | 192.9920 Ops/s | 186.9728 Ops/s | |
test_to_njt[False-False-None] | 7.0994ms | 6.7885ms | 147.3085 Ops/s | 137.6355 Ops/s | |
test_to_njt[True-False-None] | 5.8592ms | 5.5482ms | 180.2389 Ops/s | 170.1395 Ops/s | |
test_to_njt[within-False-None] | 12.7214ms | 12.4262ms | 80.4752 Ops/s | 80.6149 Ops/s | |
test_creation[device0] | 0.4604ms | 81.0372μs | 12.3400 KOps/s | 12.4446 KOps/s | |
test_creation_from_tensor | 0.5300ms | 84.0601μs | 11.8962 KOps/s | 11.9669 KOps/s | |
test_add_one[memmap_tensor0] | 0.2265ms | 6.2665μs | 159.5795 KOps/s | 161.8172 KOps/s | |
test_contiguous[memmap_tensor0] | 2.0941μs | 0.4654μs | 2.1487 MOps/s | 2.3351 MOps/s | |
test_stack[memmap_tensor0] | 0.1503ms | 4.5838μs | 218.1595 KOps/s | 226.9568 KOps/s | |
test_memmaptd_index | 1.9222ms | 0.2436ms | 4.1048 KOps/s | 4.1517 KOps/s | |
test_memmaptd_index_astensor | 0.4398ms | 0.3015ms | 3.3172 KOps/s | 3.3349 KOps/s | |
test_memmaptd_index_op | 0.6641ms | 0.5294ms | 1.8888 KOps/s | 1.6970 KOps/s | |
test_serialize_model | 0.1325s | 0.1308s | 7.6424 Ops/s | 7.6459 Ops/s | |
test_serialize_model_pickle | 1.3470s | 1.1909s | 0.8397 Ops/s | 0.8238 Ops/s | |
test_serialize_weights | 0.1321s | 0.1302s | 7.6793 Ops/s | 7.6829 Ops/s | |
test_serialize_weights_returnearly | 0.4004s | 64.3888ms | 15.5306 Ops/s | 14.1959 Ops/s | |
test_serialize_weights_pickle | 1.3768s | 1.2238s | 0.8171 Ops/s | 0.8204 Ops/s | |
test_reshape_pytree | 85.8810μs | 22.4938μs | 44.4568 KOps/s | 44.8777 KOps/s | |
test_reshape_td | 0.1981ms | 27.2913μs | 36.6417 KOps/s | 36.7647 KOps/s | |
test_view_pytree | 0.1823ms | 22.1382μs | 45.1709 KOps/s | 45.7035 KOps/s | |
test_view_td | 0.1761ms | 30.4162μs | 32.8772 KOps/s | 30.8426 KOps/s | |
test_unbind_pytree | 0.1851ms | 27.2728μs | 36.6666 KOps/s | 35.9689 KOps/s | |
test_unbind_td | 0.5642ms | 36.4439μs | 27.4394 KOps/s | 27.6511 KOps/s | |
test_split_pytree | 0.1533ms | 30.2141μs | 33.0971 KOps/s | 33.4747 KOps/s | |
test_split_td | 0.7565ms | 39.2163μs | 25.4996 KOps/s | 25.5891 KOps/s | |
test_add_pytree | 0.2331ms | 31.6634μs | 31.5822 KOps/s | 30.3381 KOps/s | |
test_add_td | 0.2104ms | 41.8778μs | 23.8790 KOps/s | 19.7922 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2744ms | 0.1210ms | 8.2662 KOps/s | 7.8169 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2983ms | 0.1300ms | 7.6951 KOps/s | 7.6435 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2461ms | 96.2670μs | 10.3878 KOps/s | 10.2420 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.0725ms | 0.1443ms | 6.9303 KOps/s | 6.8979 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1623ms | 24.2608μs | 41.2188 KOps/s | 43.5089 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1729ms | 29.9826μs | 33.3527 KOps/s | 33.6286 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3830ms | 64.4850μs | 15.5075 KOps/s | 15.3648 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1503ms | 49.1111μs | 20.3620 KOps/s | 20.2680 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3035ms | 0.1431ms | 6.9897 KOps/s | 7.0858 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4028ms | 0.2164ms | 4.6202 KOps/s | 4.6552 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2619ms | 0.1057ms | 9.4583 KOps/s | 10.3373 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2166ms | 57.3166μs | 17.4470 KOps/s | 17.4561 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2865ms | 0.1415ms | 7.0654 KOps/s | 7.4503 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6742ms | 0.4705ms | 2.1252 KOps/s | 2.1524 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4274ms | 0.2658ms | 3.7629 KOps/s | 3.8891 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3532ms | 0.1479ms | 6.7599 KOps/s | 7.1256 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2850ms | 69.4916μs | 14.3902 KOps/s | 14.7459 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2922ms | 0.1040ms | 9.6118 KOps/s | 10.2553 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5949ms | 0.3980ms | 2.5127 KOps/s | 2.5731 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2925ms | 0.1419ms | 7.0471 KOps/s | 7.5981 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1707ms | 18.4188μs | 54.2925 KOps/s | 56.3553 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 75.9320μs | 31.2321μs | 32.0183 KOps/s | 31.9595 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2154ms | 70.2447μs | 14.2360 KOps/s | 14.0035 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1444ms | 51.3154μs | 19.4873 KOps/s | 19.1807 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6708ms | 0.3981ms | 2.5122 KOps/s | 2.1964 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9946ms | 2.5407ms | 393.5899 Ops/s | 398.9089 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6467ms | 0.4469ms | 2.2376 KOps/s | 2.2890 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.9260ms | 2.5348ms | 394.5100 Ops/s | 394.5627 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5539ms | 0.1100ms | 9.0879 KOps/s | 8.6686 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5630ms | 76.4597μs | 13.0788 KOps/s | 12.3022 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.7775ms | 0.1037ms | 9.6420 KOps/s | 9.2876 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.4638ms | 64.7747μs | 15.4381 KOps/s | 14.4648 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.5308ms | 0.1067ms | 9.3698 KOps/s | 9.1714 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.4903ms | 67.4838μs | 14.8184 KOps/s | 14.4521 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.5252ms | 0.1029ms | 9.7176 KOps/s | 9.9295 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4161ms | 17.3015μs | 57.7985 KOps/s | 58.1223 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2591ms | 95.2835μs | 10.4950 KOps/s | 10.3779 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.4337ms | 15.5707μs | 64.2231 KOps/s | 63.4361 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.5021ms | 97.4876μs | 10.2577 KOps/s | 10.2916 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.4099ms | 15.6393μs | 63.9417 KOps/s | 63.6234 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.5074ms | 0.1054ms | 9.4915 KOps/s | 9.8981 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5921ms | 16.8020μs | 59.5166 KOps/s | 58.9018 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.5246ms | 99.1672μs | 10.0840 KOps/s | 10.3321 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.4174ms | 15.5666μs | 64.2402 KOps/s | 64.5744 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.5028ms | 97.2839μs | 10.2792 KOps/s | 10.3556 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.4328ms | 18.4452μs | 54.2148 KOps/s | 63.9858 KOps/s | |
test_mod_add[eager] | 0.4440ms | 36.4856μs | 27.4080 KOps/s | 25.1952 KOps/s | |
test_mod_add[compile] | 0.4860ms | 80.2764μs | 12.4570 KOps/s | 12.3893 KOps/s | |
test_mod_add[compile-overhead] | 0.3287ms | 0.1915ms | 5.2220 KOps/s | 5.5423 KOps/s | |
test_mod_wrap[eager] | 0.6496ms | 0.2409ms | 4.1511 KOps/s | 4.0432 KOps/s | |
test_mod_wrap[compile] | 0.4501ms | 0.2813ms | 3.5543 KOps/s | 3.5152 KOps/s | |
test_mod_wrap[compile-overhead] | 7.0198ms | 3.7499ms | 266.6731 Ops/s | 270.5030 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6327ms | 1.4379ms | 695.4624 Ops/s | 706.4862 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5319ms | 1.3565ms | 737.2017 Ops/s | 742.4789 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4844ms | 0.9385ms | 1.0655 KOps/s | 1.0686 KOps/s | |
test_seq_add[eager] | 0.3180ms | 0.1130ms | 8.8502 KOps/s | 8.5873 KOps/s | |
test_seq_add[compile] | 0.2609ms | 88.5577μs | 11.2921 KOps/s | 11.2690 KOps/s | |
test_seq_add[compile-overhead] | 0.3041ms | 0.1284ms | 7.7856 KOps/s | 7.6898 KOps/s | |
test_seq_wrap[eager] | 0.5919ms | 0.4071ms | 2.4567 KOps/s | 2.3617 KOps/s | |
test_seq_wrap[compile] | 0.4646ms | 0.2961ms | 3.3768 KOps/s | 3.2204 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3925ms | 0.2247ms | 4.4494 KOps/s | 4.4173 KOps/s | |
test_func_call_runtime[False-eager] | 0.8810ms | 0.7038ms | 1.4209 KOps/s | 1.3960 KOps/s | |
test_func_call_runtime[False-compile] | 1.0019ms | 0.7478ms | 1.3372 KOps/s | 1.3446 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5060ms | 0.3644ms | 2.7441 KOps/s | 2.7290 KOps/s | |
test_func_call_runtime[True-eager] | 1.0863ms | 0.8837ms | 1.1316 KOps/s | 1.1477 KOps/s | |
test_func_call_runtime[True-compile] | 0.9706ms | 0.7661ms | 1.3053 KOps/s | 1.3131 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5126ms | 0.3854ms | 2.5945 KOps/s | 2.5981 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8899ms | 0.7052ms | 1.4181 KOps/s | 1.3280 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9231ms | 0.7476ms | 1.3375 KOps/s | 1.3520 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5531ms | 0.3682ms | 2.7158 KOps/s | 2.7261 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1878ms | 0.9899ms | 1.0102 KOps/s | 1.0297 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.1395ms | 0.9617ms | 1.0398 KOps/s | 1.0400 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1150ms | 0.9558ms | 1.0462 KOps/s | 1.0410 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4441ms | 2.0229ms | 494.3329 Ops/s | 495.5169 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0046ms | 0.8176ms | 1.2231 KOps/s | 1.2121 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5671ms | 0.4181ms | 2.3915 KOps/s | 2.3660 KOps/s | |
test_distributed | 5.4416ms | 0.2222ms | 4.5013 KOps/s | 7.4311 KOps/s | |
test_tdmodule | 55.6810μs | 18.3737μs | 54.4256 KOps/s | 45.5289 KOps/s | |
test_tdmodule_dispatch | 0.1622ms | 33.2196μs | 30.1027 KOps/s | 26.3342 KOps/s | |
test_tdseq | 40.7810μs | 19.8491μs | 50.3800 KOps/s | 45.7360 KOps/s | |
test_tdseq_dispatch | 0.1211ms | 37.0455μs | 26.9938 KOps/s | 24.2692 KOps/s | |
test_instantiation_functorch | 1.7174ms | 1.5418ms | 648.5952 Ops/s | 647.7303 Ops/s | |
test_exec_functorch | 0.3464ms | 0.1407ms | 7.1088 KOps/s | 7.0309 KOps/s | |
test_exec_functional_call | 0.3146ms | 0.1295ms | 7.7239 KOps/s | 7.6060 KOps/s | |
test_exec_td_decorator | 0.3869ms | 0.1812ms | 5.5186 KOps/s | 5.4296 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8277ms | 0.6587ms | 1.5182 KOps/s | 1.5051 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8298ms | 0.6608ms | 1.5134 KOps/s | 1.4932 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7384ms | 0.5743ms | 1.7411 KOps/s | 1.7403 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7688ms | 0.5742ms | 1.7417 KOps/s | 1.7491 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 18.7637ms | 18.4827ms | 54.1045 Ops/s | 54.0444 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.5438ms | 18.6291ms | 53.6794 Ops/s | 54.3141 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.3319ms | 18.4292ms | 54.2618 Ops/s | 54.8432 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.4312ms | 18.5507ms | 53.9062 Ops/s | 54.6281 Ops/s | |
test_to_module_speed[True] | 1.0843ms | 0.9614ms | 1.0401 KOps/s | 1.0470 KOps/s | |
test_to_module_speed[False] | 1.4265ms | 0.9535ms | 1.0487 KOps/s | 1.0584 KOps/s | |
test_tc_init | 0.1089ms | 35.3577μs | 28.2824 KOps/s | 27.1504 KOps/s | |
test_tc_init_nested | 99.9220μs | 69.7618μs | 14.3345 KOps/s | 13.5579 KOps/s | |
test_tc_first_layer_tensor | 29.5200μs | 0.8299μs | 1.2049 MOps/s | 1.2122 MOps/s | |
test_tc_first_layer_nontensor | 25.0710μs | 2.2700μs | 440.5212 KOps/s | 436.9573 KOps/s | |
test_tc_second_layer_tensor | 10.5303μs | 1.4324μs | 698.1272 KOps/s | 698.2298 KOps/s | |
test_tc_second_layer_nontensor | 26.0300μs | 3.0084μs | 332.4081 KOps/s | 332.8712 KOps/s | |
test_unbind | 0.2465s | 10.7915ms | 92.6657 Ops/s | 142.4595 Ops/s | |
test_full_like | 11.9676ms | 10.4247ms | 95.9262 Ops/s | 94.0270 Ops/s | |
test_zeros_like | 5.1306ms | 4.6014ms | 217.3268 Ops/s | 215.9141 Ops/s | |
test_ones_like | 5.5506ms | 4.7100ms | 212.3121 Ops/s | 213.5197 Ops/s | |
test_clone | 13.1741ms | 10.0183ms | 99.8172 Ops/s | 131.2426 Ops/s | |
test_squeeze | 0.1003ms | 9.6537μs | 103.5875 KOps/s | 105.9075 KOps/s | |
test_unsqueeze | 0.2196ms | 72.3016μs | 13.8309 KOps/s | 14.1280 KOps/s | |
test_split | 0.2713ms | 0.1573ms | 6.3577 KOps/s | 6.4115 KOps/s | |
test_permute | 0.3021ms | 0.1733ms | 5.7700 KOps/s | 5.7062 KOps/s | |
test_stack | 53.9251ms | 51.7830ms | 19.3113 Ops/s | 18.6483 Ops/s | |
test_cat | 54.5988ms | 52.4317ms | 19.0724 Ops/s | 19.1286 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):